Dataset statistics
| Number of variables | 12 |
|---|---|
| Number of observations | 5840 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 547.6 KiB |
| Average record size in memory | 96.0 B |
Variable types
| Text | 1 |
|---|---|
| Categorical | 2 |
| Numeric | 8 |
| Boolean | 1 |
Grams has constant value "" | Constant |
Calories is highly overall correlated with Calories_per_Gram and 6 other fields | High correlation |
Calories_per_Gram is highly overall correlated with Calories and 6 other fields | High correlation |
Carbs is highly overall correlated with Calories and 2 other fields | High correlation |
Fat is highly overall correlated with Calories and 4 other fields | High correlation |
Fiber is highly overall correlated with Carbs | High correlation |
Is_low_calorie is highly overall correlated with Calories and 1 other fields | High correlation |
Protein is highly overall correlated with Calories and 4 other fields | High correlation |
Protein_Percent is highly overall correlated with Calories and 4 other fields | High correlation |
Sat.Fat is highly overall correlated with Calories and 4 other fields | High correlation |
Is_low_calorie is highly imbalanced (56.6%) | Imbalance |
Protein has 615 (10.5%) zeros | Zeros |
Fat has 1078 (18.5%) zeros | Zeros |
Sat.Fat has 1714 (29.3%) zeros | Zeros |
Fiber has 2163 (37.0%) zeros | Zeros |
Carbs has 682 (11.7%) zeros | Zeros |
Protein_Percent has 615 (10.5%) zeros | Zeros |
Reproduction
| Analysis started | 2023-12-10 07:10:36.465001 |
|---|---|
| Analysis finished | 2023-12-10 07:10:52.677313 |
| Duration | 16.21 seconds |
| Software version | ydata-profiling vv4.6.2 |
| Download configuration | config.json |
Food
Text
| Distinct | 5819 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.8 KiB |
Length
| Max length | 184 |
|---|---|
| Median length | 103 |
| Mean length | 34.73476 |
| Min length | 3 |
Characters and Unicode
| Total characters | 202851 |
|---|---|
| Distinct characters | 70 |
| Distinct categories | 8 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 5799 ? |
|---|---|
| Unique (%) | 99.3% |
Sample
| 1st row | Cows' milk |
|---|---|
| 2nd row | Milk skim |
| 3rd row | Buttermilk |
| 4th row | Evaporated, undiluted |
| 5th row | Fortified milk |
| Value | Count | Frequency (%) |
| with | 1298 | 4.1% |
| fat | 919 | 2.9% |
| or | 892 | 2.8% |
| and | 616 | 1.9% |
| cooked | 590 | 1.9% |
| added | 546 | 1.7% |
| as | 500 | 1.6% |
| to | 493 | 1.6% |
| ns | 484 | 1.5% |
| cheese | 412 | 1.3% |
| Other values (1664) | 24886 |
Most occurring characters
| Value | Count | Frequency (%) |
| 25810 | 12.7% | |
| e | 20923 | 10.3% |
| a | 15396 | 7.6% |
| o | 12668 | 6.2% |
| t | 11779 | 5.8% |
| r | 11259 | 5.6% |
| , | 10450 | 5.2% |
| d | 9982 | 4.9% |
| i | 9503 | 4.7% |
| n | 8423 | 4.2% |
| Other values (60) | 66658 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 156812 | |
| Space Separator | 25810 | 12.7% |
| Other Punctuation | 10890 | 5.4% |
| Uppercase Letter | 8192 | 4.0% |
| Dash Punctuation | 701 | 0.3% |
| Decimal Number | 230 | 0.1% |
| Close Punctuation | 108 | 0.1% |
| Open Punctuation | 108 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 20923 | |
| a | 15396 | 9.8% |
| o | 12668 | 8.1% |
| t | 11779 | 7.5% |
| r | 11259 | 7.2% |
| d | 9982 | 6.4% |
| i | 9503 | 6.1% |
| n | 8423 | 5.4% |
| s | 7825 | 5.0% |
| c | 6965 | 4.4% |
| Other values (16) | 42089 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1431 | |
| S | 1270 | |
| P | 985 | |
| N | 751 | |
| B | 599 | |
| F | 476 | 5.8% |
| R | 408 | 5.0% |
| T | 332 | 4.1% |
| M | 322 | 3.9% |
| G | 219 | 2.7% |
| Other values (16) | 1399 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 10450 | |
| / | 218 | 2.0% |
| ; | 113 | 1.0% |
| % | 63 | 0.6% |
| ' | 23 | 0.2% |
| " | 17 | 0.2% |
| . | 6 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 99 | |
| 1 | 75 | |
| 2 | 36 | 15.7% |
| 5 | 8 | 3.5% |
| 4 | 7 | 3.0% |
| 3 | 4 | 1.7% |
| 9 | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 25810 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 701 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 108 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 108 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 165004 | |
| Common | 37847 | 18.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 20923 | |
| a | 15396 | 9.3% |
| o | 12668 | 7.7% |
| t | 11779 | 7.1% |
| r | 11259 | 6.8% |
| d | 9982 | 6.0% |
| i | 9503 | 5.8% |
| n | 8423 | 5.1% |
| s | 7825 | 4.7% |
| c | 6965 | 4.2% |
| Other values (42) | 50281 |
Common
| Value | Count | Frequency (%) |
| 25810 | ||
| , | 10450 | |
| - | 701 | 1.9% |
| / | 218 | 0.6% |
| ; | 113 | 0.3% |
| ) | 108 | 0.3% |
| ( | 108 | 0.3% |
| 0 | 99 | 0.3% |
| 1 | 75 | 0.2% |
| % | 63 | 0.2% |
| Other values (8) | 102 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 202851 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 25810 | 12.7% | |
| e | 20923 | 10.3% |
| a | 15396 | 7.6% |
| o | 12668 | 6.2% |
| t | 11779 | 5.8% |
| r | 11259 | 5.6% |
| , | 10450 | 5.2% |
| d | 9982 | 4.9% |
| i | 9503 | 4.7% |
| n | 8423 | 4.2% |
| Other values (60) | 66658 |
Grams
Categorical
CONSTANT 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.8 KiB |
| 100 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 17520 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 100 |
|---|---|
| 2nd row | 100 |
| 3rd row | 100 |
| 4th row | 100 |
| 5th row | 100 |
Common Values
| Value | Count | Frequency (%) |
| 100 | 5840 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 100 | 5840 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11680 | |
| 1 | 5840 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17520 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11680 | |
| 1 | 5840 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17520 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 11680 | |
| 1 | 5840 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17520 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11680 | |
| 1 | 5840 |
Calories
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 617 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201.87295 |
| Minimum | 0 |
|---|---|
| Maximum | 902 |
| Zeros | 37 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 79 |
| median | 166.5 |
| Q3 | 282 |
| 95-th percentile | 497 |
| Maximum | 902 |
| Range | 902 |
| Interquartile range (IQR) | 203 |
Descriptive statistics
| Standard deviation | 152.3006 |
|---|---|
| Coefficient of variation (CV) | 0.75443789 |
| Kurtosis | 1.6211788 |
| Mean | 201.87295 |
| Median Absolute Deviation (MAD) | 99.5 |
| Skewness | 1.1667981 |
| Sum | 1178938 |
| Variance | 23195.472 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 41 | 0.7% |
| 65 | 40 | 0.7% |
| 0 | 37 | 0.6% |
| 51 | 35 | 0.6% |
| 64 | 33 | 0.6% |
| 134 | 33 | 0.6% |
| 48 | 33 | 0.6% |
| 62 | 32 | 0.5% |
| 127 | 32 | 0.5% |
| 44 | 31 | 0.5% |
| Other values (607) | 5493 |
| Value | Count | Frequency (%) |
| 0 | 37 | |
| 1 | 26 | |
| 2 | 12 | 0.2% |
| 3 | 11 | 0.2% |
| 4 | 9 | 0.2% |
| 5 | 14 | 0.2% |
| 6 | 7 | 0.1% |
| 7 | 5 | 0.1% |
| 8 | 4 | 0.1% |
| 9 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 902 | 1 | < 0.1% |
| 900 | 16 | |
| 896 | 1 | < 0.1% |
| 895 | 1 | < 0.1% |
| 893 | 3 | 0.1% |
| 822 | 1 | < 0.1% |
| 767 | 1 | < 0.1% |
| 748 | 2 | < 0.1% |
| 745 | 1 | < 0.1% |
| 740 | 2 | < 0.1% |
Protein
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 58 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6232877 |
| Minimum | -2 |
|---|---|
| Maximum | 102 |
| Zeros | 615 |
| Zeros (%) | 10.5% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 12 |
| 95-th percentile | 26 |
| Maximum | 102 |
| Range | 104 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.9967096 |
|---|---|
| Coefficient of variation (CV) | 1.0433039 |
| Kurtosis | 7.1642142 |
| Mean | 8.6232877 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.8411842 |
| Sum | 50360 |
| Variance | 80.940783 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 621 | 10.6% |
| 0 | 615 | 10.5% |
| 1 | 567 | 9.7% |
| 3 | 494 | 8.5% |
| 4 | 334 | 5.7% |
| 5 | 300 | 5.1% |
| 6 | 243 | 4.2% |
| 7 | 239 | 4.1% |
| 8 | 235 | 4.0% |
| 9 | 231 | 4.0% |
| Other values (48) | 1961 |
| Value | Count | Frequency (%) |
| -2 | 1 | < 0.1% |
| 0 | 615 | |
| 1 | 567 | |
| 2 | 621 | |
| 3 | 494 | |
| 4 | 334 | |
| 5 | 300 | |
| 6 | 243 | 4.2% |
| 7 | 239 | 4.1% |
| 8 | 235 | 4.0% |
| Value | Count | Frequency (%) |
| 102 | 1 | |
| 101 | 1 | |
| 78 | 2 | |
| 76 | 1 | |
| 64 | 1 | |
| 63 | 2 | |
| 61 | 1 | |
| 59 | 1 | |
| 57 | 1 | |
| 56 | 2 |
Fat
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 80 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.2383562 |
| Minimum | 0 |
|---|---|
| Maximum | 103 |
| Zeros | 1078 |
| Zeros (%) | 18.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 5 |
| Q3 | 13 |
| 95-th percentile | 30 |
| Maximum | 103 |
| Range | 103 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 12.456784 |
|---|---|
| Coefficient of variation (CV) | 1.3483767 |
| Kurtosis | 16.313757 |
| Mean | 9.2383562 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 3.3249942 |
| Sum | 53952 |
| Variance | 155.17147 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1078 | |
| 3 | 510 | 8.7% |
| 1 | 422 | 7.2% |
| 2 | 380 | 6.5% |
| 4 | 354 | 6.1% |
| 5 | 256 | 4.4% |
| 7 | 229 | 3.9% |
| 8 | 209 | 3.6% |
| 6 | 205 | 3.5% |
| 11 | 196 | 3.4% |
| Other values (70) | 2001 |
| Value | Count | Frequency (%) |
| 0 | 1078 | |
| 1 | 422 | 7.2% |
| 2 | 380 | 6.5% |
| 3 | 510 | |
| 4 | 354 | 6.1% |
| 5 | 256 | 4.4% |
| 6 | 205 | 3.5% |
| 7 | 229 | 3.9% |
| 8 | 209 | 3.6% |
| 9 | 165 | 2.8% |
| Value | Count | Frequency (%) |
| 103 | 1 | < 0.1% |
| 101 | 1 | < 0.1% |
| 100 | 21 | |
| 99 | 2 | < 0.1% |
| 82 | 2 | < 0.1% |
| 81 | 6 | 0.1% |
| 80 | 1 | < 0.1% |
| 79 | 2 | < 0.1% |
| 78 | 3 | 0.1% |
| 76 | 1 | < 0.1% |
Sat.Fat
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 54 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.0695205 |
| Minimum | 0 |
|---|---|
| Maximum | 104 |
| Zeros | 1714 |
| Zeros (%) | 29.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 11 |
| Maximum | 104 |
| Range | 104 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 5.6683739 |
|---|---|
| Coefficient of variation (CV) | 1.8466643 |
| Kurtosis | 76.827994 |
| Mean | 3.0695205 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 6.7468791 |
| Sum | 17926 |
| Variance | 32.130463 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1714 | |
| 1 | 1352 | |
| 2 | 701 | |
| 3 | 541 | 9.3% |
| 4 | 373 | 6.4% |
| 5 | 288 | 4.9% |
| 6 | 172 | 2.9% |
| 7 | 105 | 1.8% |
| 8 | 105 | 1.8% |
| 9 | 96 | 1.6% |
| Other values (44) | 393 | 6.7% |
| Value | Count | Frequency (%) |
| 0 | 1714 | |
| 1 | 1352 | |
| 2 | 701 | |
| 3 | 541 | 9.3% |
| 4 | 373 | 6.4% |
| 5 | 288 | 4.9% |
| 6 | 172 | 2.9% |
| 7 | 105 | 1.8% |
| 8 | 105 | 1.8% |
| 9 | 96 | 1.6% |
| Value | Count | Frequency (%) |
| 104 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 88 | 1 | < 0.1% |
| 84 | 1 | < 0.1% |
| 82 | 1 | < 0.1% |
| 71 | 1 | < 0.1% |
| 68 | 1 | < 0.1% |
| 64 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| 51 | 3 |
Fiber
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6907534 |
| Minimum | 0 |
|---|---|
| Maximum | 104 |
| Zeros | 2163 |
| Zeros (%) | 37.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 6 |
| Maximum | 104 |
| Range | 104 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.1237945 |
|---|---|
| Coefficient of variation (CV) | 1.8475755 |
| Kurtosis | 395.69037 |
| Mean | 1.6907534 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 14.074437 |
| Sum | 9874 |
| Variance | 9.7580923 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2163 | |
| 1 | 1416 | |
| 2 | 1098 | |
| 3 | 455 | 7.8% |
| 4 | 262 | 4.5% |
| 6 | 98 | 1.7% |
| 5 | 94 | 1.6% |
| 7 | 59 | 1.0% |
| 10 | 56 | 1.0% |
| 8 | 48 | 0.8% |
| Other values (18) | 91 | 1.6% |
| Value | Count | Frequency (%) |
| 0 | 2163 | |
| 1 | 1416 | |
| 2 | 1098 | |
| 3 | 455 | 7.8% |
| 4 | 262 | 4.5% |
| 5 | 94 | 1.6% |
| 6 | 98 | 1.7% |
| 7 | 59 | 1.0% |
| 8 | 48 | 0.8% |
| 9 | 29 | 0.5% |
| Value | Count | Frequency (%) |
| 104 | 1 | < 0.1% |
| 102 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 27 | 3 | |
| 23 | 1 | < 0.1% |
| 20 | 2 | |
| 19 | 2 |
Carbs
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 104 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 21.15 |
| Minimum | 0 |
|---|---|
| Maximum | 128 |
| Zeros | 682 |
| Zeros (%) | 11.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 13 |
| Q3 | 29 |
| 95-th percentile | 70 |
| Maximum | 128 |
| Range | 128 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 21.97581 |
|---|---|
| Coefficient of variation (CV) | 1.0390454 |
| Kurtosis | 1.1579573 |
| Mean | 21.15 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.3731536 |
| Sum | 123516 |
| Variance | 482.93622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 682 | 11.7% |
| 13 | 237 | 4.1% |
| 4 | 214 | 3.7% |
| 10 | 202 | 3.5% |
| 5 | 187 | 3.2% |
| 6 | 184 | 3.2% |
| 12 | 175 | 3.0% |
| 1 | 164 | 2.8% |
| 8 | 163 | 2.8% |
| 16 | 161 | 2.8% |
| Other values (94) | 3471 |
| Value | Count | Frequency (%) |
| 0 | 682 | |
| 1 | 164 | 2.8% |
| 2 | 121 | 2.1% |
| 3 | 157 | 2.7% |
| 4 | 214 | 3.7% |
| 5 | 187 | 3.2% |
| 6 | 184 | 3.2% |
| 7 | 159 | 2.7% |
| 8 | 163 | 2.8% |
| 9 | 155 | 2.7% |
| Value | Count | Frequency (%) |
| 128 | 1 | < 0.1% |
| 105 | 1 | < 0.1% |
| 103 | 1 | < 0.1% |
| 100 | 9 | |
| 99 | 4 | |
| 98 | 6 | |
| 97 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 95 | 4 | |
| 94 | 1 | < 0.1% |
Category
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 45.8 KiB |
| Miscellaneous | |
|---|---|
| Meat, Poultry | |
| Breads, cereals, fastfood,grains | |
| Dairy products | |
| Potato | 105 |
| Other values (4) |
Length
| Max length | 33 |
|---|---|
| Median length | 13 |
| Mean length | 15.694692 |
| Min length | 6 |
Characters and Unicode
| Total characters | 91657 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Dairy products |
|---|---|
| 2nd row | Dairy products |
| 3rd row | Dairy products |
| 4th row | Dairy products |
| 5th row | Dairy products |
Common Values
| Value | Count | Frequency (%) |
| Miscellaneous | 3072 | |
| Meat, Poultry | 966 | 16.5% |
| Breads, cereals, fastfood,grains | 897 | 15.4% |
| Dairy products | 481 | 8.2% |
| Potato | 105 | 1.8% |
| Cookie | 100 | 1.7% |
| Coffee | 91 | 1.6% |
| Vegetables | 71 | 1.2% |
| Fruits | 57 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| miscellaneous | 3072 | |
| meat | 966 | 10.6% |
| poultry | 966 | 10.6% |
| breads | 897 | 9.9% |
| cereals | 897 | 9.9% |
| fastfood,grains | 897 | 9.9% |
| dairy | 481 | 5.3% |
| products | 481 | 5.3% |
| potato | 105 | 1.2% |
| cookie | 100 | 1.1% |
| Other values (3) | 219 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 10341 | |
| e | 10296 | |
| a | 8283 | 9.0% |
| l | 8078 | 8.8% |
| o | 6814 | 7.4% |
| r | 4676 | 5.1% |
| i | 4607 | 5.0% |
| u | 4576 | 5.0% |
| c | 4450 | 4.9% |
| 4138 | 4.5% | |
| Other values (17) | 25398 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 77056 | |
| Uppercase Letter | 6806 | 7.4% |
| Space Separator | 4138 | 4.5% |
| Other Punctuation | 3657 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 10341 | |
| e | 10296 | |
| a | 8283 | |
| l | 8078 | |
| o | 6814 | |
| r | 4676 | |
| i | 4607 | |
| u | 4576 | |
| c | 4450 | |
| n | 3969 | 5.2% |
| Other values (8) | 10966 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4038 | |
| P | 1071 | 15.7% |
| B | 897 | 13.2% |
| D | 481 | 7.1% |
| C | 191 | 2.8% |
| V | 71 | 1.0% |
| F | 57 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 4138 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3657 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 83862 | |
| Common | 7795 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 10341 | |
| e | 10296 | |
| a | 8283 | |
| l | 8078 | |
| o | 6814 | 8.1% |
| r | 4676 | 5.6% |
| i | 4607 | 5.5% |
| u | 4576 | 5.5% |
| c | 4450 | 5.3% |
| M | 4038 | 4.8% |
| Other values (15) | 17703 |
Common
| Value | Count | Frequency (%) |
| 4138 | ||
| , | 3657 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 91657 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 10341 | |
| e | 10296 | |
| a | 8283 | 9.0% |
| l | 8078 | 8.8% |
| o | 6814 | 7.4% |
| r | 4676 | 5.1% |
| i | 4607 | 5.0% |
| u | 4576 | 5.0% |
| c | 4450 | 4.9% |
| 4138 | 4.5% | |
| Other values (17) | 25398 |
Calories_per_Gram
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 617 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.0187295 |
| Minimum | 0 |
|---|---|
| Maximum | 9.02 |
| Zeros | 37 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.27 |
| Q1 | 0.79 |
| median | 1.665 |
| Q3 | 2.82 |
| 95-th percentile | 4.97 |
| Maximum | 9.02 |
| Range | 9.02 |
| Interquartile range (IQR) | 2.03 |
Descriptive statistics
| Standard deviation | 1.523006 |
|---|---|
| Coefficient of variation (CV) | 0.75443789 |
| Kurtosis | 1.6211788 |
| Mean | 2.0187295 |
| Median Absolute Deviation (MAD) | 0.995 |
| Skewness | 1.1667981 |
| Sum | 11789.38 |
| Variance | 2.3195472 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.5 | 41 | 0.7% |
| 0.65 | 40 | 0.7% |
| 0 | 37 | 0.6% |
| 0.51 | 35 | 0.6% |
| 0.64 | 33 | 0.6% |
| 1.34 | 33 | 0.6% |
| 0.48 | 33 | 0.6% |
| 0.62 | 32 | 0.5% |
| 1.27 | 32 | 0.5% |
| 0.44 | 31 | 0.5% |
| Other values (607) | 5493 |
| Value | Count | Frequency (%) |
| 0 | 37 | |
| 0.01 | 26 | |
| 0.02 | 12 | 0.2% |
| 0.03 | 11 | 0.2% |
| 0.04 | 9 | 0.2% |
| 0.05 | 14 | 0.2% |
| 0.06 | 7 | 0.1% |
| 0.07 | 5 | 0.1% |
| 0.08 | 4 | 0.1% |
| 0.09 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 9.02 | 1 | < 0.1% |
| 9 | 16 | |
| 8.96 | 1 | < 0.1% |
| 8.95 | 1 | < 0.1% |
| 8.93 | 3 | 0.1% |
| 8.22 | 1 | < 0.1% |
| 7.67 | 1 | < 0.1% |
| 7.48 | 2 | < 0.1% |
| 7.45 | 1 | < 0.1% |
| 7.4 | 2 | < 0.1% |
Protein_Percent
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 58 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.6232877 |
| Minimum | -2 |
|---|---|
| Maximum | 102 |
| Zeros | 615 |
| Zeros (%) | 10.5% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 45.8 KiB |
Quantile statistics
| Minimum | -2 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 12 |
| 95-th percentile | 26 |
| Maximum | 102 |
| Range | 104 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.9967096 |
|---|---|
| Coefficient of variation (CV) | 1.0433039 |
| Kurtosis | 7.1642142 |
| Mean | 8.6232877 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.8411842 |
| Sum | 50360 |
| Variance | 80.940783 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 621 | 10.6% |
| 0 | 615 | 10.5% |
| 1 | 567 | 9.7% |
| 3 | 494 | 8.5% |
| 4 | 334 | 5.7% |
| 5 | 300 | 5.1% |
| 6 | 243 | 4.2% |
| 7 | 239 | 4.1% |
| 8 | 235 | 4.0% |
| 9 | 231 | 4.0% |
| Other values (48) | 1961 |
| Value | Count | Frequency (%) |
| -2 | 1 | < 0.1% |
| 0 | 615 | |
| 1 | 567 | |
| 2 | 621 | |
| 3 | 494 | |
| 4 | 334 | |
| 5 | 300 | |
| 6 | 243 | 4.2% |
| 7 | 239 | 4.1% |
| 8 | 235 | 4.0% |
| Value | Count | Frequency (%) |
| 102 | 1 | |
| 101 | 1 | |
| 78 | 2 | |
| 76 | 1 | |
| 64 | 1 | |
| 63 | 2 | |
| 61 | 1 | |
| 59 | 1 | |
| 57 | 1 | |
| 56 | 2 |
Is_low_calorie
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.8 KiB |
| False | |
|---|---|
| True | 521 |
| Value | Count | Frequency (%) |
| False | 5319 | |
| True | 521 | 8.9% |
| Calories | Calories_per_Gram | Carbs | Category | Fat | Fiber | Is_low_calorie | Protein | Protein_Percent | Sat.Fat | |
|---|---|---|---|---|---|---|---|---|---|---|
| Calories | 1.000 | 1.000 | 0.547 | 0.210 | 0.805 | 0.261 | 0.503 | 0.515 | 0.515 | 0.718 |
| Calories_per_Gram | 1.000 | 1.000 | 0.547 | 0.210 | 0.805 | 0.261 | 0.503 | 0.515 | 0.515 | 0.718 |
| Carbs | 0.547 | 0.547 | 1.000 | 0.221 | 0.169 | 0.582 | 0.335 | -0.107 | -0.107 | 0.117 |
| Category | 0.210 | 0.210 | 0.221 | 1.000 | -0.139 | 0.023 | 0.350 | -0.186 | -0.186 | -0.129 |
| Fat | 0.805 | 0.805 | 0.169 | -0.139 | 1.000 | 0.119 | 0.210 | 0.557 | 0.557 | 0.921 |
| Fiber | 0.261 | 0.261 | 0.582 | 0.023 | 0.119 | 1.000 | 0.014 | -0.036 | -0.036 | 0.023 |
| Is_low_calorie | 0.503 | 0.503 | 0.335 | 0.350 | 0.210 | 0.014 | 1.000 | -0.396 | -0.396 | -0.379 |
| Protein | 0.515 | 0.515 | -0.107 | -0.186 | 0.557 | -0.036 | -0.396 | 1.000 | 1.000 | 0.502 |
| Protein_Percent | 0.515 | 0.515 | -0.107 | -0.186 | 0.557 | -0.036 | -0.396 | 1.000 | 1.000 | 0.502 |
| Sat.Fat | 0.718 | 0.718 | 0.117 | -0.129 | 0.921 | 0.023 | -0.379 | 0.502 | 0.502 | 1.000 |
| Food | Grams | Calories | Protein | Fat | Sat.Fat | Fiber | Carbs | Category | Calories_per_Gram | Protein_Percent | Is_low_calorie | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Cows' milk | 100 | 68 | 3 | 4 | 4 | 0 | 5 | Dairy products | 0.68 | 3 | No |
| 1 | Milk skim | 100 | 37 | 4 | 0 | 0 | 0 | 5 | Dairy products | 0.37 | 4 | Yes |
| 2 | Buttermilk | 100 | 52 | 4 | 2 | 2 | 0 | 5 | Dairy products | 0.52 | 4 | No |
| 3 | Evaporated, undiluted | 100 | 137 | 6 | 8 | 7 | 0 | 10 | Dairy products | 1.37 | 6 | No |
| 4 | Fortified milk | 100 | 97 | 6 | 3 | 2 | 0 | 8 | Dairy products | 0.97 | 6 | No |
| 5 | Powdered milk | 100 | 500 | 26 | 27 | 23 | 0 | 38 | Dairy products | 5.00 | 26 | No |
| 6 | skim, instant | 100 | 341 | 35 | 0 | 0 | 0 | 49 | Dairy products | 3.41 | 35 | No |
| 7 | skim, non-instant | 100 | 341 | 35 | 0 | 0 | 1 | 49 | Dairy products | 3.41 | 35 | No |
| 8 | Goats' milk | 100 | 68 | 3 | 4 | 3 | 0 | 5 | Dairy products | 0.68 | 3 | No |
| 9 | (1/2 cup ice cream) | 100 | 128 | 4 | 4 | 4 | 0 | 13 | Dairy products | 1.28 | 4 | No |
| Food | Grams | Calories | Protein | Fat | Sat.Fat | Fiber | Carbs | Category | Calories_per_Gram | Protein_Percent | Is_low_calorie | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5830 | Tomatoes, cooked, as ingredient | 100 | 26 | 1 | 0 | 0 | 2 | 5 | Miscellaneous | 0.26 | 1 | Yes |
| 5831 | Onions, cooked, as ingredient | 100 | 50 | 1 | 0 | 0 | 2 | 11 | Miscellaneous | 0.50 | 1 | No |
| 5832 | Mushrooms, cooked, as ingredient | 100 | 36 | 4 | 0 | 0 | 1 | 4 | Miscellaneous | 0.36 | 4 | Yes |
| 5833 | Green pepper, cooked, as ingredient | 100 | 25 | 1 | 0 | 0 | 2 | 5 | Miscellaneous | 0.25 | 1 | Yes |
| 5834 | Red pepper, cooked, as ingredient | 100 | 33 | 1 | 0 | 0 | 2 | 6 | Miscellaneous | 0.33 | 1 | Yes |
| 5835 | Cabbage, cooked, as ingredient | 100 | 30 | 1 | 0 | 0 | 3 | 6 | Miscellaneous | 0.30 | 1 | Yes |
| 5836 | Cauliflower, cooked, as ingredient | 100 | 31 | 2 | 0 | 0 | 2 | 5 | Miscellaneous | 0.31 | 2 | Yes |
| 5837 | Eggplant, cooked, as ingredient | 100 | 31 | 1 | 0 | 0 | 3 | 6 | Meat, Poultry | 0.31 | 1 | Yes |
| 5838 | Green beans, cooked, as ingredient | 100 | 39 | 2 | 0 | 0 | 3 | 7 | Miscellaneous | 0.39 | 2 | Yes |
| 5839 | Summer squash, cooked, as ingredient | 100 | 25 | 1 | 0 | 0 | 1 | 4 | Miscellaneous | 0.25 | 1 | Yes |